A Control Structure For A Question-Answering System
نویسنده
چکیده
The c o n t r o l s t r u c t u r e o f a q u e s t i o n a n s w e r i n g s y s t e m i s d e r i v e d f r o m a s e t o f b a s i c a s s u m p t i o n s . F o r i t s d e s i g n and i m p l e m e n t a t i o n , a n o r m a l a l g o r i t h m i c l a n g u a g e i s u s e d . The d e s i g n l e a d s t o " s y m m e t r i c " p r o c e d u r e s , t h e r e s u l t i n g p r o g r a m s u s e t h e s t a c k m e c h a n i s m f o r r e c u r s i o n t o g i v e t h e c o r r e c t s t o r a g e a l l o c a t i o n . Two s t r a t e g i e s a r e d i s c u s s e d i n more d e t a i l : a t r i v i a l w e l l k n o w n d e p t h f i r s t s t r a t e g y and a new o n e , c a l l e d f l e x i b l e d e c i s i o n t r e e s t r a t e g y . N o l i n g u i s t i c a s p e c t s ( e i t h e r s y n t a c t i c o r s e m a n t i c ) o f t h e s y s t e m a r e d i s c u s s e d I n t r o d u c t i o n The a u t h o r and h i s c o l l e a g u e s h a v e d e v e l o p e d a p r o t o t y p e o f a q u e s t i o n a n s w e r i n g s y s t e m . A d o c u m e n t d e s c r i b i n g t h e w h o l e s y s t e m and t h e c o n s i d e r a t i o n s t h a t h a v e l e a d t o t h e d e s i g n h a s n o t y e t b e e n p u b l i s h e d , b u t w i 1 1 a p p e a r b e f o r e l o n g . A s t h i s p a p e r d i s c u s s e s t h e a l g o r i t h m i c a s p e c t s o n a f a i r l y a b s t r a c t l e v e l , o n l y a r o u g h s k e t c h o f t h e s y s t e m ' s p r o p e r t i e s i s n e e d e d . T h i s s k e t c h i s g i v e n i n s e c t i o n 1 . I n o r d e r t o b e a b l e t o g i v e some e x a m p l e s , s e c t i o n 1 a l s o c o n t a i n s a s u p e r f i c i a l d e s c r i p t i o n o f some p r o p e r t i e s o f t h e i n v e r s e o f d i s c o u r s e o f o u r s y s t o m . The r e a d e r s h o u l d h o w e v e r r e a l i z e , t h a t t h i s p a p e r i s n o t and i s n o t i n t e n d e d t o be a d e s c r i p t i o n o f t h e s y s t e m . Some o f t h e c h a r u r t e r i s t i e s t h a t seem t o b e common f o r a l l A . I . s y s t e m s , l i k e t h e e x i s t e n c e o f l o c a l a m b i g u i t i e s , a r e a l s o p r e s e n t i n o u r s y s t e m . I n c o n t r a s t t o w h a t seems t o b e c u s t o m a r y f o r t h e c o n s t r u c t i o n o f A . I . p r o g r a m s , n o l i s t p r o c e s s i n g l a n g u a g e ( L I S p o r one o f i t s i m p r o v e m e n t s PLANNER o r CONNIVER) was u s e d . I n t h e o p i n i o n o f t h e a u t h o r , m a t t e r s s u c h a s a u t o m a t i c b a c k t r a c k i n g and g a r b a g e c o l l e c t i o n s h o u l d k e p t i n s i g h t . They may b e p r e s e n t o n l y i m p l i c i t l y a s o p p o s e d t o b e i n g p r o g r a m m e d e x p l i c i t l y b u t t h e p r o g r a m m e r s h o u l d c o n t r o l t h e m , a s h e c o n t r o l s some s t o r a g e a l l o c a t i o n m e c h a n i s m v i a i n v o c a t i o n and t e r m i n a t i o n o f p r o c e d u r e s , t h e r e b y i m p l i c i t l y c o n t r o l l i n g t h e s i z e o f t h e s t a c k . One o f t h e t a r g e t s o f t h e p r o j e c t was t o r e a c h a c l e a n , w e l l s t r u e t u r e d p r o g r a m b y means o f a " t o p d o w n " d e s i g n m e t h o d . T h i s p a p e r c l a i m s t o show how t h e c o n t r o l s t r u c t u r o was d e r i v e d ( i n a n i n f o r m a l w a y ) f r o m a s m a l l number o f b a s i c a s s u m p t i o n s . F o r t h e n o t a t i o n o f t h e a l g o r i t h m s i n s e c t i o n 6 , n o f o r m a l l y d e f i n e d p r o g r a m m i n g l a n g u a g e i s u s e d . M o s t o f t h e s y m b o l s a r e b o r r o w e d f r o m ALGOL68, t h e r e m a i n i n g o n e s a r e s u p p o s e d t o b e s e l f e x p l a n a t o r y . F o r d e s i g n and p u b l i c a t i o n p u r p o s e s , a l g o r i t h m i c d e s c r i p t i o n s t h a t a r e e a s y t o r e a d f o r human r e a d e r s a r e p r e f e r r e d t o a l g o r i t h m i c d e s c r i p t i o n s t h a t c a n b e p r o c e s s e d b y a c o m p i l e r . 1 • A r o u g h s k e t c h o f t h e p r o p e r t i e s o f t h e s y s t e m The s y s t e m ( c a l l e d PHLIQA f o r P H i L I p s Q u e s t i o n A n s w e r i n g ) i s a b l e t o a n s w e r q u e s t i o n s f o r m u l a t e d i n a n a t u r a l l a n g u a g e ( E n g l i s h ) , w h e r e t h e a n s w e r s t o t h e s e q u e s t i o n s a r e f a c t s , e i t h e r t o b e f o u n d d i r e c t l y i n a d a t a b a s e i n s i d e t h e s y s t e m o r t o b e d e r i v e d f r o m i n f o r m a t i o n s t o r e d i n t h e d a t a b a s e . As in mos t p r e s e n t , day A • 1 • s y s terns ? t h e c i a i m i s n o t t h a t PHLIQA c a n u n d e r s t a n d t h e f u l l n a t u r a l l a n g u a g e ; i t i s o n l y c a p a b l e o f p r o c e s s i n g t h e q u e s t i o n s r e l a t i n g t o t h e r e s t r i c t e d u n i v e r s e o f d i s c o u r s e , a s d i c t a t e d b y t h e c o n t e n t s o f t h e d a t a b a s e . H o w e v e r , t h e s y s t e m i s d e s i g n e d i n s u c h a w a y , t h a t a g r e a t p a r t o f t h e p r o g r a m i s i n d e p e n d e n t o f t h e a c t u a l u n i v e r s e o f d i s c o u r s e ; t h i s may become c l e a r a t t h e end o f t h i s s e c t i o n . The p r o c e s s t h a t a n s w e r s a q u e s t i o n , i s d i v i d e d i n t o t h r e e s u b p r o c e s s e s : t h e i n t e r p r e t a t i o n o f t h e q u e s t i o n ( r e s u l t i n g i n a n e v a l u a b l e e x p r e s s i o n ) , t h e e v a l u a t i o n o f t h a t e x p r e s s i o n ( r e s u l t i n g i n t h e v a l u e o f t h e a n s w e r ) and t h e f o r m u l a t i o n o f t h e a n s w e r . I n t h e c u r r e n t p a p e r , o n l y t h e i n t e r p r e t a t i o n p r o c e s s i s d i s c u s s e d . I n o r d e r t o g e t a t r a n s p a r e n t and w e l l s t r u c t u r e d p r o g r a m , t h e i n t e r p r e t a t i o n p r o c e s s i s t o o c o m p l i c a t e d t o b e p e r f o r m e d i n one g i a n t s t e p . B e t w e e n t h e n a t u r a l l a n g u a g e and t h e l e v e l o f t h e e v a l u a b l e e x p r e s s i o n s , a number o f i n t e r m e d i a t e l a n g u a g e l e v e l s h a s b e e n d e s i g n e d The c o n v e r s i o n f r o m a n e x p r e s s i o n a t a c e r t a i n l a n g u a g e l e v e l t o i t s e q u i v a l e n t a t t h e n e x t l o w e r l e v e l i s p e r f o r m e d b y p r o g r a m m o d u l e s c a l l e d c o n v e r t e r s . The g e n e r a l and common a s p e c t s o f t h e c o n t r o l s t r u c t u r e o f t h o s e c o n v e r t e r s f o r m t h e t o p i c o f t h i s p a p e r . The c o n c r e t e d e f i n i t i o n s o f t h e i n t e r m e d i a t e l a n g u a g e s a r e n o t r e l e v a n t t o t h i s d i s c u s s i o n , and a r e n o t g i v e n . A l l t h e s e l a n g u a g e s h a v e one a s p e c t i n common: e x c e p t f o r t h e u p p e r m o s t l e v e l ( t h e n a t u r a l l a n g u a g e ) a l l e x p r e s s i o n s i n t h e s e l a n g u a g e s a r e r e p r e s e n t e d a s t r e e s . A l l c o n v e r t e r s ( w i t h t h e t r i v i a l e x c e p t i o n o f t h e u p p e r m o s t o n e ) w i l l h a v e a t r e e a s i n p u t and w i l l p r o d u c e a
منابع مشابه
ارایه یک پیکره پرسش و پاسخ مذهبی در زبان فارسی
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
متن کاملInvestigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملBoosting Passage Retrieval through Reuse in Question Answering
Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...
متن کاملA New Statistical Model for Evaluation Interactive Question Answering Systems Using Regression
The development of computer systems and extensive use of information technology in the everyday life of people have just made it more and more important for them to make quick access to information that has received great importance. Increasing the volume of information makes it difficult to manage or control. Thus, some instruments need to be provided to use this information. The QA system is ...
متن کاملOptimizing question answering systems by Accelerated Particle Swarm Optimization (APSO)
One of the most important research areas in natural language processing is Question Answering Systems (QASs). Existing search engines, with Google at the top, have many remarkable capabilities. But there is a basic limitation (search engines do not have deduction capability), a capability which a QAS is expected to have. In this perspective, a search engine may be viewed as a semi-mechanized QA...
متن کاملLoad-Frequency Control: a GA based Bayesian Networks Multi-agent System
Bayesian Networks (BN) provides a robust probabilistic method of reasoning under uncertainty. They have been successfully applied in a variety of real-world tasks but they have received little attention in the area of load-frequency control (LFC). In practice, LFC systems use proportional-integral controllers. However since these controllers are designed using a linear model, the nonlinearities...
متن کامل